Covariance And Contravariance Of Vectors
   HOME

TheInfoList



OR:

In physics, especially in
multilinear algebra Multilinear algebra is a subfield of mathematics that extends the methods of linear algebra. Just as linear algebra is built on the concept of a vector and develops the theory of vector spaces, multilinear algebra builds on the concepts of ''p' ...
and
tensor analysis In mathematics and physics, a tensor field assigns a tensor to each point of a mathematical space (typically a Euclidean space or manifold). Tensor fields are used in differential geometry, algebraic geometry, general relativity, in the analysis ...
, covariance and contravariance describe how the quantitative description of certain geometric or physical entities changes with a
change of basis In mathematics, an ordered basis of a vector space of finite dimension allows representing uniquely any element of the vector space by a coordinate vector, which is a sequence of scalars called coordinates. If two different bases are consider ...
. In modern mathematical notation, the role is sometimes swapped. In physics, a basis is sometimes thought of as a set of reference axes. A change of scale on the reference axes corresponds to a change of units in the problem. For instance, by changing scale from meters to centimeters (that is, ''dividing'' the scale of the reference axes by 100), the components of a measured
velocity Velocity is the directional speed of an object in motion as an indication of its rate of change in position as observed from a particular frame of reference and as measured by a particular standard of time (e.g. northbound). Velocity i ...
vector Vector most often refers to: *Euclidean vector, a quantity with a magnitude and a direction *Vector (epidemiology), an agent that carries and transmits an infectious pathogen into another living organism Vector may also refer to: Mathematic ...
are ''multiplied'' by 100. A vector changes scale ''inversely'' to changes in scale to the reference axes, and consequently is called ''contravariant''. As a result, a vector often has units of distance or distance with other units (as, for example, velocity has units of distance divided by time). In contrast, a covector, also called a ''dual vector'', typically has units of the inverse of distance or the inverse of distance with other units. For example, a
gradient In vector calculus, the gradient of a scalar-valued differentiable function of several variables is the vector field (or vector-valued function) \nabla f whose value at a point p is the "direction and rate of fastest increase". If the gr ...
which has units of a spatial
derivative In mathematics, the derivative of a function of a real variable measures the sensitivity to change of the function value (output value) with respect to a change in its argument (input value). Derivatives are a fundamental tool of calculus. ...
, or distance−1. The components of a covector changes in the ''same way'' as changes to scale of the reference axes, and consequently is called ''covariant''. A third concept related to covariance and contravariance is invariance. An example of a physical
observable In physics, an observable is a physical quantity that can be measured. Examples include position and momentum. In systems governed by classical mechanics, it is a real-valued "function" on the set of all possible system states. In quantum phy ...
that does not change with a change of scale on the reference axes is the
mass Mass is an intrinsic property of a body. It was traditionally believed to be related to the quantity of matter in a physical body, until the discovery of the atom and particle physics. It was found that different atoms and different eleme ...
of a particle, which has units of mass (that is, no units of distance). The single,
scalar Scalar may refer to: *Scalar (mathematics), an element of a field, which is used to define a vector space, usually the field of real numbers * Scalar (physics), a physical quantity that can be described by a single element of a number field such ...
value of mass is independent of changes to the scale of the reference axes and consequently is called ''invariant''. Under more general changes in basis: * A contravariant vector or
tangent vector In mathematics, a tangent vector is a vector that is tangent to a curve or surface at a given point. Tangent vectors are described in the differential geometry of curves in the context of curves in R''n''. More generally, tangent vectors are e ...
(often abbreviated simply as ''vector'', such as a
direction vector In mathematics, a unit vector in a normed vector space is a vector (often a spatial vector) of length 1. A unit vector is often denoted by a lowercase letter with a circumflex, or "hat", as in \hat (pronounced "v-hat"). The term ''direction ve ...
or velocity vector) has components that ''contra-vary'' with a change of basis to compensate. That is, the matrix that transforms the vector components must be the inverse of the matrix that transforms the basis vectors. The components of vectors (as opposed to those of covectors) are said to be contravariant. Examples of vectors with ''contravariant components'' include the position of an object relative to an observer, or any derivative of position with respect to time, including velocity,
acceleration In mechanics, acceleration is the rate of change of the velocity of an object with respect to time. Accelerations are vector quantities (in that they have magnitude and direction). The orientation of an object's acceleration is given by t ...
, and jerk. In Einstein notation (implicit summation over repeated index), contravariant components are denoted with ''upper indices'' as in *:\mathbf = v^i \mathbf_i * A covariant vector or
cotangent vector In differential geometry, the cotangent space is a vector space associated with a point x on a smooth (or differentiable) manifold \mathcal M; one can define a cotangent space for every point on a smooth manifold. Typically, the cotangent space, T ...
(often abbreviated as ''covector'') has components that ''co-vary'' with a change of basis. That is, the components must be transformed by the same matrix as the change of basis matrix. The components of covectors (as opposed to those of vectors) are said to be covariant. Examples of covariant vectors generally appear when taking a
gradient In vector calculus, the gradient of a scalar-valued differentiable function of several variables is the vector field (or vector-valued function) \nabla f whose value at a point p is the "direction and rate of fastest increase". If the gr ...
of a function. In Einstein notation, covariant components are denoted with ''lower indices'' as in *:\mathbf = w_i \mathbf^i.
Curvilinear coordinate system In geometry, curvilinear coordinates are a coordinate system for Euclidean space in which the coordinate lines may be curved. These coordinates may be derived from a set of Cartesian coordinates by using a transformation that is locally inv ...
s, such as cylindrical or
spherical coordinates In mathematics, a spherical coordinate system is a coordinate system for three-dimensional space where the position of a point is specified by three numbers: the ''radial distance'' of that point from a fixed origin, its ''polar angle'' meas ...
, are often used in physical and geometric problems. Associated with any coordinate system is a natural choice of coordinate basis for vectors based at each point of the space, and covariance and contravariance are particularly important for understanding how the coordinate description of a vector changes by passing from one coordinate system to another. The terms ''covariant'' and ''contravariant'' were introduced by James Joseph Sylvester in 1851 in the context of associated algebraic forms theory.
Tensor In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects related to a vector space. Tensors may map between different objects such as vectors, scalars, and even other tensor ...
s are objects in
multilinear algebra Multilinear algebra is a subfield of mathematics that extends the methods of linear algebra. Just as linear algebra is built on the concept of a vector and develops the theory of vector spaces, multilinear algebra builds on the concepts of ''p' ...
that can have aspects of both covariance and contravariance. In the lexicon of category theory, covariance and contravariance are properties of
functor In mathematics, specifically category theory, a functor is a mapping between categories. Functors were first considered in algebraic topology, where algebraic objects (such as the fundamental group) are associated to topological spaces, and m ...
s; unfortunately, it is the lower-index objects (covectors) that generically have
pullback In mathematics, a pullback is either of two different, but related processes: precomposition and fiber-product. Its dual is a pushforward. Precomposition Precomposition with a function probably provides the most elementary notion of pullback: i ...
s, which are contravariant, while the upper-index objects (vectors) instead have
pushforward The notion of pushforward in mathematics is "dual" to the notion of pullback, and can mean a number of different but closely related things. * Pushforward (differential), the differential of a smooth map between manifolds, and the "pushforward" op ...
s, which are covariant. This terminological conflict may be avoided by calling contravariant functors "cofunctors"—in accord with the "covector" terminology, and continuing the tradition of treating vectors as the concept and covectors as the coconcept.


Introduction

In physics, a vector typically arises as the outcome of a measurement or series of measurements, and is represented as a list (or
tuple In mathematics, a tuple is a finite ordered list (sequence) of elements. An -tuple is a sequence (or ordered list) of elements, where is a non-negative integer. There is only one 0-tuple, referred to as ''the empty tuple''. An -tuple is defi ...
) of numbers such as :(v_1,v_2,v_3) . The numbers in the list depend on the choice of coordinate system. For instance, if the vector represents position with respect to an observer ( position vector), then the coordinate system may be obtained from a system of rigid rods, or reference axes, along which the components ''v''1, ''v''2, and ''v''3 are measured. For a vector to represent a geometric object, it must be possible to describe how it looks in any other coordinate system. That is to say, the components of the vectors will ''transform'' in a certain way in passing from one coordinate system to another. A ''contravariant vector'' has components that "transform as the coordinates do" under changes of coordinates (and so inversely to the transformation of the reference axes), including rotation and dilation. The vector itself does not change under these operations; instead, the components of the vector change in a way that cancels the change in the spatial axes, in the same way that coordinates change. In other words, if the reference axes were rotated in one direction, the component representation of the vector would rotate in exactly the opposite way. Similarly, if the reference axes were stretched in one direction, the components of the vector, like the coordinates, would reduce in an exactly compensating way. Mathematically, if the coordinate system undergoes a transformation described by an
invertible matrix In linear algebra, an -by- square matrix is called invertible (also nonsingular or nondegenerate), if there exists an -by- square matrix such that :\mathbf = \mathbf = \mathbf_n \ where denotes the -by- identity matrix and the multiplicati ...
''M'', so that a coordinate vector x is transformed to \mathbf'=M\mathbf, then a contravariant vector v must be similarly transformed via \mathbf'=M\mathbf. This important requirement is what distinguishes a contravariant vector from any other triple of physically meaningful quantities. For example, if ''v'' consists of the ''x''-, ''y''-, and ''z''-components of
velocity Velocity is the directional speed of an object in motion as an indication of its rate of change in position as observed from a particular frame of reference and as measured by a particular standard of time (e.g. northbound). Velocity i ...
, then ''v'' is a contravariant vector: if the coordinates of space are stretched, rotated, or twisted, then the components of the velocity transform in the same way. Examples of contravariant vectors include
position Position often refers to: * Position (geometry), the spatial location (rather than orientation) of an entity * Position, a job or occupation Position may also refer to: Games and recreation * Position (poker), location relative to the dealer * ...
,
displacement Displacement may refer to: Physical sciences Mathematics and Physics * Displacement (geometry), is the difference between the final and initial position of a point trajectory (for instance, the center of mass of a moving object). The actual path ...
,
velocity Velocity is the directional speed of an object in motion as an indication of its rate of change in position as observed from a particular frame of reference and as measured by a particular standard of time (e.g. northbound). Velocity i ...
,
acceleration In mechanics, acceleration is the rate of change of the velocity of an object with respect to time. Accelerations are vector quantities (in that they have magnitude and direction). The orientation of an object's acceleration is given by t ...
, momentum, and force. By contrast, a ''covariant vector'' has components that change oppositely to the coordinates or, equivalently, transform like the reference axes. For instance, the components of the
gradient In vector calculus, the gradient of a scalar-valued differentiable function of several variables is the vector field (or vector-valued function) \nabla f whose value at a point p is the "direction and rate of fastest increase". If the gr ...
vector of a function :\nabla f = \frac\widehat^1+\frac\widehat^2+\frac\widehat^3 transform like the reference axes themselves.


Definition

The general formulation of covariance and contravariance refer to how the components of a coordinate vector transform under a
change of basis In mathematics, an ordered basis of a vector space of finite dimension allows representing uniquely any element of the vector space by a coordinate vector, which is a sequence of scalars called coordinates. If two different bases are consider ...
( passive transformation). Thus let ''V'' be a
vector space In mathematics and physics, a vector space (also called a linear space) is a set whose elements, often called '' vectors'', may be added together and multiplied ("scaled") by numbers called ''scalars''. Scalars are often real numbers, but can ...
of dimension ''n'' over a
field Field may refer to: Expanses of open ground * Field (agriculture), an area of land used for agricultural purposes * Airfield, an aerodrome that lacks the infrastructure of an airport * Battlefield * Lawn, an area of mowed grass * Meadow, a grass ...
of scalars ''S'', and let each of and be a
basis Basis may refer to: Finance and accounting * Adjusted basis, the net cost of an asset after adjusting for various tax-related items *Basis point, 0.01%, often used in the context of interest rates * Basis trading, a trading strategy consisting ...
of ''V''.A basis f may here profitably be viewed as a
linear isomorphism In mathematics, and more specifically in linear algebra, a linear map (also called a linear mapping, linear transformation, vector space homomorphism, or in some contexts linear function) is a mapping V \to W between two vector spaces that pre ...
from R''n'' to ''V''. Regarding f as a row vector whose entries are the elements of the basis, the associated linear isomorphism is then \mathbf\mapsto \mathbf\mathbf.
Also, let the
change of basis In mathematics, an ordered basis of a vector space of finite dimension allows representing uniquely any element of the vector space by a coordinate vector, which is a sequence of scalars called coordinates. If two different bases are consider ...
from f to f′ be given by for some
invertible In mathematics, the concept of an inverse element generalises the concepts of opposite () and reciprocal () of numbers. Given an operation denoted here , and an identity element denoted , if , one says that is a left inverse of , and that is ...
''n''×''n'' matrix ''A'' with entries a^i_j. Here, each vector ''Y''''j'' of the f′ basis is a linear combination of the vectors ''X''''i'' of the f basis, so that :Y_j=\sum_i a^i_jX_i.


Contravariant transformation

A vector v in ''V'' is expressed uniquely as a linear combination of the elements X_i of the f basis as where ''v'' ''fare elements of the field ''S'' known as the components of ''v'' in the f basis. Denote the
column vector In linear algebra, a column vector with m elements is an m \times 1 matrix consisting of a single column of m entries, for example, \boldsymbol = \begin x_1 \\ x_2 \\ \vdots \\ x_m \end. Similarly, a row vector is a 1 \times n matrix for some n, c ...
of components of ''v'' by v ''f :\mathbf mathbf= \beginv^1 mathbf\v^2 mathbf\\vdots\\v^n mathbfend so that () can be rewritten as a matrix product :v = \mathbf\, \mathbf mathbf The vector ''v'' may also be expressed in terms of the f′ basis, so that :v = \mathbf\, \mathbf mathbf However, since the vector ''v'' itself is invariant under the choice of basis, :\mathbf\, \mathbf mathbf= v = \mathbf\, \mathbf mathbf The invariance of ''v'' combined with the relationship () between f and f′ implies that :\mathbf\, \mathbf mathbf= \mathbfA\, \mathbf mathbfA giving the transformation rule :\mathbf mathbf= \mathbf mathbfA= A^\mathbf mathbf In terms of components, :v^i mathbfA= \sum_j \tilde^i_jv^j mathbf/math> where the coefficients \tilde^i_j are the entries of the
inverse matrix In linear algebra, an -by- square matrix is called invertible (also nonsingular or nondegenerate), if there exists an -by- square matrix such that :\mathbf = \mathbf = \mathbf_n \ where denotes the -by- identity matrix and the multiplicati ...
of ''A''. Because the components of the vector ''v'' transform with the ''inverse'' of the matrix ''A'', these components are said to transform contravariantly under a change of basis. The way ''A'' relates the two pairs is depicted in the following informal diagram using an arrow. The reversal of the arrow indicates a contravariant change: :\begin \mathbf &\longrightarrow \mathbf \\ v mathbf&\longleftarrow v mathbf\end


Covariant transformation

A
linear functional In mathematics, a linear form (also known as a linear functional, a one-form, or a covector) is a linear map from a vector space to its field of scalars (often, the real numbers or the complex numbers). If is a vector space over a field , the ...
''α'' on ''V'' is expressed uniquely in terms of its components (elements in ''S'') in the f basis as :\alpha(X_i) = \alpha_i mathbf, \quad i=1,2,\dots,n. These components are the action of ''α'' on the basis vectors ''X''''i'' of the f basis. Under the change of basis from f to f′ (), the components transform so that Denote the
row vector In linear algebra, a column vector with m elements is an m \times 1 matrix consisting of a single column of m entries, for example, \boldsymbol = \begin x_1 \\ x_2 \\ \vdots \\ x_m \end. Similarly, a row vector is a 1 \times n matrix for some n, c ...
of components of ''α'' by ''α'' ''f :\mathbf mathbf= \begin\alpha_1 mathbf\alpha_2 mathbf\dots,\alpha_n mathbfend so that () can be rewritten as the matrix product :\alpha mathbfA= \alpha mathbf. Because the components of the linear functional α transform with the matrix ''A'', these components are said to transform covariantly under a change of basis. The way ''A'' relates the two pairs is depicted in the following informal diagram using an arrow. A covariant relationship is indicated since the arrows travel in the same direction: :\begin \mathbf &\longrightarrow \mathbf \\ \alpha mathbf&\longrightarrow \alpha mathbf\end Had a column vector representation been used instead, the transformation law would be the
transpose In linear algebra, the transpose of a matrix is an operator which flips a matrix over its diagonal; that is, it switches the row and column indices of the matrix by producing another matrix, often denoted by (among other notations). The tr ...
:\alpha^\mathrm mathbfA= A^\mathrm\alpha^\mathrm mathbf


Coordinates

The choice of basis f on the vector space ''V'' defines uniquely a set of coordinate functions on ''V'', by means of :x^i mathbfv) = v^i mathbf The coordinates on ''V'' are therefore contravariant in the sense that :x^i mathbfA= \sum_^n \tilde^i_kx^k mathbf Conversely, a system of ''n'' quantities ''v''''i'' that transform like the coordinates ''x''''i'' on ''V'' defines a contravariant vector. A system of ''n'' quantities that transform oppositely to the coordinates is then a covariant vector. This formulation of contravariance and covariance is often more natural in applications in which there is a coordinate space (a manifold) on which vectors live as
tangent vector In mathematics, a tangent vector is a vector that is tangent to a curve or surface at a given point. Tangent vectors are described in the differential geometry of curves in the context of curves in R''n''. More generally, tangent vectors are e ...
s or
cotangent vector In differential geometry, the cotangent space is a vector space associated with a point x on a smooth (or differentiable) manifold \mathcal M; one can define a cotangent space for every point on a smooth manifold. Typically, the cotangent space, T ...
s. Given a local coordinate system ''x''''i'' on the manifold, the reference axes for the coordinate system are the vector fields :X_1 = \frac,\dots,X_n=\frac. This gives rise to the frame at every point of the coordinate patch. If ''y''''i'' is a different coordinate system and :Y_1=\frac,\dots,Y_n = \frac, then the frame f' is related to the frame f by the inverse of the Jacobian matrix of the coordinate transition: :\mathbf' = \mathbfJ^,\quad J = \left(\frac\right)_^n. Or, in indices, :\frac = \sum_^n\frac\frac. A tangent vector is by definition a vector that is a linear combination of the coordinate partials \partial/\partial x^i. Thus a tangent vector is defined by :v = \sum_^n v^i mathbfX_i = \mathbf\ \mathbf mathbf Such a vector is contravariant with respect to change of frame. Under changes in the coordinate system, one has :\mathbf\left mathbf'\right= \mathbf\left mathbfJ^\right= J\, \mathbf mathbf Therefore, the components of a tangent vector transform via :v^i\left mathbf'\right= \sum_^n \fracv^j mathbf Accordingly, a system of ''n'' quantities ''v''''i'' depending on the coordinates that transform in this way on passing from one coordinate system to another is called a contravariant vector.


Covariant and contravariant components of a vector with a metric

In a finite-dimensional
vector space In mathematics and physics, a vector space (also called a linear space) is a set whose elements, often called '' vectors'', may be added together and multiplied ("scaled") by numbers called ''scalars''. Scalars are often real numbers, but can ...
''V'' over a field ''K'' with a symmetric bilinear form (which may be referred to as the metric tensor), there is little distinction between covariant and contravariant vectors, because the bilinear form allows covectors to be identified with vectors. That is, a vector ''v'' uniquely determines a covector ''α'' via :\alpha(w) = g(v, w) for all vectors ''w''. Conversely, each covector ''α'' determines a unique vector ''v'' by this equation. Because of this identification of vectors with covectors, one may speak of the covariant components or contravariant components of a vector, that is, they are just representations of the same vector using the reciprocal basis. Given a basis of ''V'', there is a unique reciprocal basis of ''V'' determined by requiring that :Y^i(X_j) = \delta^i_j, the
Kronecker delta In mathematics, the Kronecker delta (named after Leopold Kronecker) is a function of two variables, usually just non-negative integers. The function is 1 if the variables are equal, and 0 otherwise: \delta_ = \begin 0 &\text i \neq j, \\ 1 & ...
. In terms of these bases, any vector ''v'' can be written in two ways: :\begin v &= \sum_i v^i mathbf_i = \mathbf\,\mathbf mathbf\ &=\sum_i v_i mathbf^i = \mathbf^\sharp\mathbf^\sharp mathbf \end The components ''v''''i'' ''fare the contravariant components of the vector ''v'' in the basis f, and the components ''v''''i'' ''fare the covariant components of ''v'' in the basis f. The terminology is justified because under a change of basis, :\mathbf mathbfA= A^\mathbf mathbf\quad \mathbf^\sharp mathbfA= A^T\mathbf^\sharp mathbf


Euclidean plane

In the Euclidean plane, the
dot product In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used sometimes for other symmetric bilinear forms, for example in a pseudo-Euclidean space. is an alge ...
allows for vectors to be identified with covectors. If \mathbf_1,\mathbf_2 is a basis, then the dual basis \mathbf^1,\mathbf^2 satisfies :\begin \mathbf^1\cdot\mathbf_1 = 1, &\quad \mathbf^1\cdot\mathbf_2 = 0 \\ \mathbf^2\cdot\mathbf_1 = 0, &\quad \mathbf^2\cdot\mathbf_2 = 1. \end Thus, e1 and e2 are perpendicular to each other, as are e2 and e1, and the lengths of e1 and e2 normalized against e1 and e2, respectively.


Example

For example, suppose that we are given a basis e1, e2 consisting of a pair of vectors making a 45° angle with one another, such that e1 has length 2 and e2 has length 1. Then the dual basis vectors are given as follows: * e2 is the result of rotating e1 through an angle of 90° (where the sense is measured by assuming the pair e1, e2 to be positively oriented), and then rescaling so that holds. * e1 is the result of rotating e2 through an angle of 90°, and then rescaling so that holds. Applying these rules, we find :\mathbf^1 = \frac\mathbf_1 - \frac\mathbf_2 and :\mathbf^2 = -\frac\mathbf_1 + 2\mathbf_2. Thus the change of basis matrix in going from the original basis to the reciprocal basis is :R = \begin \frac & -\frac \\ -\frac & 2 \end, since : mathbf^1\ \mathbf^2= mathbf_1\ \mathbf_2begin \frac & -\frac \\ -\frac & 2 \end. For instance, the vector :v = \frac\mathbf_1 + 2\mathbf_2 is a vector with contravariant components :v^1 = \frac,\quad v^2 = 2. The covariant components are obtained by equating the two expressions for the vector ''v'': :v = v_1\mathbf^1 + v_2\mathbf^2 = v^1\mathbf_1 + v^2\mathbf_2 so :\begin \beginv_1\\ v_2\end &= R^\beginv^1 \\ v^2\end \\ &= \begin4 & \sqrt \\ \sqrt & 1\end \beginv^1 \\ v^2\end \\ &= \begin6 + 2\sqrt \\ 2 + \frac\end \end.


Three-dimensional Euclidean space

In the three-dimensional
Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are Euclidean ...
, one can also determine explicitly the dual basis to a given set of
basis vector In mathematics, a set of vectors in a vector space is called a basis if every element of may be written in a unique way as a finite linear combination of elements of . The coefficients of this linear combination are referred to as components ...
s e1, e2, e3 of ''E''3 that are not necessarily assumed to be orthogonal nor of unit norm. The dual basis vectors are: : \mathbf^1 = \frac ; \qquad \mathbf^2 = \frac; \qquad \mathbf^3 = \frac. Even when the ei and ei are not
orthonormal In linear algebra, two vectors in an inner product space are orthonormal if they are orthogonal (or perpendicular along a line) unit vectors. A set of vectors form an orthonormal set if all vectors in the set are mutually orthogonal and all of un ...
, they are still mutually reciprocal: :\mathbf^i \cdot \mathbf_j = \delta^i_j, Then the contravariant components of any vector v can be obtained by the
dot product In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used sometimes for other symmetric bilinear forms, for example in a pseudo-Euclidean space. is an alge ...
of v with the dual basis vectors: : q^1 = \mathbf \cdot \mathbf^1; \qquad q^2 = \mathbf \cdot \mathbf^2; \qquad q^3 = \mathbf \cdot \mathbf^3 . Likewise, the covariant components of v can be obtained from the dot product of v with basis vectors, viz. : q_1 = \mathbf \cdot \mathbf_1; \qquad q_2 = \mathbf \cdot \mathbf_2; \qquad q_3 = \mathbf \cdot \mathbf_3 . Then v can be expressed in two (reciprocal) ways, viz. : \mathbf = q^i \mathbf_i = q^1 \mathbf_1 + q^2 \mathbf_2 + q^3 \mathbf_3 . or : \mathbf = q_i \mathbf^i = q_1 \mathbf^1 + q_2 \mathbf^2 + q_3 \mathbf^3 Combining the above relations, we have : \mathbf = (\mathbf \cdot \mathbf^i) \mathbf_i = (\mathbf \cdot \mathbf_i) \mathbf^i and we can convert between the basis and dual basis with :q_i = \mathbf\cdot \mathbf_i = (q^j \mathbf_j)\cdot \mathbf_i = (\mathbf_j\cdot\mathbf_i) q^j and :q^i = \mathbf\cdot \mathbf^i = (q_j \mathbf^j)\cdot \mathbf^i = (\mathbf^j\cdot\mathbf^i) q_j . If the basis vectors are
orthonormal In linear algebra, two vectors in an inner product space are orthonormal if they are orthogonal (or perpendicular along a line) unit vectors. A set of vectors form an orthonormal set if all vectors in the set are mutually orthogonal and all of un ...
, then they are the same as the dual basis vectors.


General Euclidean spaces

More generally, in an ''n''-dimensional Euclidean space ''V'', if a basis is :\mathbf_1,\dots,\mathbf_n, the reciprocal basis is given by (double indices are summed over), :\mathbf^i=g^\mathbf_j where the coefficients ''g''''ij'' are the entries of the inverse matrix of :g_ = \mathbf_i\cdot\mathbf_j . Indeed, we then have :\mathbf^i\cdot\mathbf_k=g^\mathbf_j\cdot\mathbf_k=g^g_ = \delta^i_k . The covariant and contravariant components of any vector : \mathbf = q_i \mathbf^i = q^i \mathbf_i \, are related as above by :q_i = \mathbf\cdot \mathbf_i = (q^j \mathbf_j)\cdot \mathbf_i = q^jg_ and :q^i = \mathbf\cdot \mathbf^i = (q_j\mathbf^j)\cdot \mathbf^i = q_jg^ .


Informal usage

In the field of
physics Physics is the natural science that studies matter, its fundamental constituents, its motion and behavior through space and time, and the related entities of energy and force. "Physical science is that department of knowledge which r ...
, the
adjective In linguistics, an adjective (abbreviated ) is a word that generally modifies a noun or noun phrase or describes its referent. Its semantic role is to change information given by the noun. Traditionally, adjectives were considered one of the ma ...
covariant is often used informally as a synonym for invariant. For example, the
Schrödinger equation The Schrödinger equation is a linear partial differential equation that governs the wave function of a quantum-mechanical system. It is a key result in quantum mechanics, and its discovery was a significant landmark in the development of th ...
does not keep its written form under the coordinate transformations of
special relativity In physics, the special theory of relativity, or special relativity for short, is a scientific theory regarding the relationship between space and time. In Albert Einstein's original treatment, the theory is based on two postulates: # The laws ...
. Thus, a physicist might say that the Schrödinger equation is ''not covariant''. In contrast, the
Klein–Gordon equation The Klein–Gordon equation (Klein–Fock–Gordon equation or sometimes Klein–Gordon–Fock equation) is a relativistic wave equation, related to the Schrödinger equation. It is second-order in space and time and manifestly Lorentz-covariant ...
and the
Dirac equation In particle physics, the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928. In its free form, or including electromagnetic interactions, it describes all spin- massive particles, called "Dirac par ...
do keep their written form under these coordinate transformations. Thus, a physicist might say that these equations are ''covariant''. Despite this usage of "covariant", it is more accurate to say that the Klein–Gordon and Dirac equations are invariant, and that the Schrödinger equation is not invariant. Additionally, to remove ambiguity, the transformation by which the invariance is evaluated should be indicated. Because the components of vectors are contravariant and those of covectors are covariant, the vectors themselves are often referred to as being contravariant and the covectors as covariant.


Use in tensor analysis

The distinction between covariance and contravariance is particularly important for computations with
tensor In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects related to a vector space. Tensors may map between different objects such as vectors, scalars, and even other tensor ...
s, which often have mixed variance. This means that they have both covariant and contravariant components, or both vector and covector components. The valence of a tensor is the number of variant and covariant terms, and in Einstein notation, covariant components have lower indices, while contravariant components have upper indices. The duality between covariance and contravariance intervenes whenever a vector or tensor quantity is represented by its components, although modern differential geometry uses more sophisticated index-free methods to represent tensors. In
tensor analysis In mathematics and physics, a tensor field assigns a tensor to each point of a mathematical space (typically a Euclidean space or manifold). Tensor fields are used in differential geometry, algebraic geometry, general relativity, in the analysis ...
, a covariant vector varies more or less reciprocally to a corresponding contravariant vector. Expressions for lengths, areas and volumes of objects in the vector space can then be given in terms of tensors with covariant and contravariant indices. Under simple expansions and contractions of the coordinates, the reciprocity is exact; under affine transformations the components of a vector intermingle on going between covariant and contravariant expression. On a manifold, a
tensor field In mathematics and physics, a tensor field assigns a tensor to each point of a mathematical space (typically a Euclidean space or manifold). Tensor fields are used in differential geometry, algebraic geometry, general relativity, in the analysis ...
will typically have multiple, upper and lower indices, where Einstein notation is widely used. When the manifold is equipped with a
metric Metric or metrical may refer to: * Metric system, an internationally adopted decimal system of measurement * An adjective indicating relation to measurement in general, or a noun describing a specific type of measurement Mathematics In mathem ...
, covariant and contravariant indices become very closely related to one another. Contravariant indices can be turned into covariant indices by contracting with the metric tensor. The reverse is possible by contracting with the (matrix) inverse of the metric tensor. Note that in general, no such relation exists in spaces not endowed with a metric tensor. Furthermore, from a more abstract standpoint, a tensor is simply "there" and its components of either kind are only calculational artifacts whose values depend on the chosen coordinates. The explanation in geometric terms is that a general tensor will have contravariant indices as well as covariant indices, because it has parts that live in the
tangent bundle In differential geometry, the tangent bundle of a differentiable manifold M is a manifold TM which assembles all the tangent vectors in M . As a set, it is given by the disjoint unionThe disjoint union ensures that for any two points and of ...
as well as the cotangent bundle. A contravariant vector is one which transforms like \frac, where x^ \! are the coordinates of a particle at its
proper time In relativity, proper time (from Latin, meaning ''own time'') along a timelike world line is defined as the time as measured by a clock following that line. It is thus independent of coordinates, and is a Lorentz scalar. The proper time interval ...
\tau. A covariant vector is one which transforms like \frac, where \varphi is a scalar field.


Algebra and geometry

In category theory, there are
covariant functor In mathematics, specifically category theory, a functor is a mapping between categories. Functors were first considered in algebraic topology, where algebraic objects (such as the fundamental group) are associated to topological spaces, and m ...
s and
contravariant functor In mathematics, specifically category theory, a functor is a mapping between categories. Functors were first considered in algebraic topology, where algebraic objects (such as the fundamental group) are associated to topological spaces, and ...
s. The assignment of the dual space to a vector space is a standard example of a contravariant functor. Some constructions of
multilinear algebra Multilinear algebra is a subfield of mathematics that extends the methods of linear algebra. Just as linear algebra is built on the concept of a vector and develops the theory of vector spaces, multilinear algebra builds on the concepts of ''p' ...
are of "mixed" variance, which prevents them from being functors. In differential geometry, the components of a vector relative to a basis of the
tangent bundle In differential geometry, the tangent bundle of a differentiable manifold M is a manifold TM which assembles all the tangent vectors in M . As a set, it is given by the disjoint unionThe disjoint union ensures that for any two points and of ...
are covariant if they change with the same linear transformation as a change of basis. They are contravariant if they change by the inverse transformation. This is sometimes a source of confusion for two distinct but related reasons. The first is that vectors whose components are covariant (called covectors or
1-form In differential geometry, a one-form on a differentiable manifold is a smooth section of the cotangent bundle. Equivalently, a one-form on a manifold M is a smooth mapping of the total space of the tangent bundle of M to \R whose restriction to ...
s) actually
pull back In mathematics, a pullback is either of two different, but related processes: precomposition and fiber-product. Its dual is a pushforward. Precomposition Precomposition with a function probably provides the most elementary notion of pullback: i ...
under smooth functions, meaning that the operation assigning the space of covectors to a smooth manifold is actually a ''contravariant'' functor. Likewise, vectors whose components are contravariant push forward under smooth mappings, so the operation assigning the space of (contravariant) vectors to a smooth manifold is a ''covariant'' functor. Secondly, in the classical approach to differential geometry, it is not bases of the tangent bundle that are the most primitive object, but rather changes in the coordinate system. Vectors with contravariant components transform in the same way as changes in the coordinates (because these actually change oppositely to the induced change of basis). Likewise, vectors with covariant components transform in the opposite way as changes in the coordinates.


See also

*
Active and passive transformation Active may refer to: Music * ''Active'' (album), a 1992 album by Casiopea * Active Records, a record label Ships * ''Active'' (ship), several commercial ships by that name * HMS ''Active'', the name of various ships of the British Royal ...
*
Mixed tensor In tensor analysis, a mixed tensor is a tensor which is neither strictly covariant nor strictly contravariant; at least one of the indices of a mixed tensor will be a subscript (covariant) and at least one of the indices will be a superscript ( ...
*
Two-point tensor Two-point tensors, or double vectors, are tensor-like quantities which transform as Euclidean vectors with respect to each of their indices. They are used in continuum mechanics to transform between reference ("material") and present ("configurat ...
, a generalization allowing indices to reference multiple vector bases


Notes


Citations


References

* . * . * . * . * . * .


External links

* * * *
Invariance, Contravariance, and Covariance
* {{tensors Tensors Differential geometry Riemannian geometry Vectors (mathematics and physics)